2006  CCRTS 


THE  STATE  OF  THE  ART  AND  THE  STATE  OF  THE  PRACTICE 


Taxonomic  and  Faceted  Classification  for  Intelligent  Tagging  and  Discovery 
in  Net-Centric  Command  and  Control 

The  publication  of  this  paper  does  not  indicate  endorsement  by  the  Department  of 
Defense  or  IDA,  nor  should  the  contents  be  construed  as  reflecting  the  official 
positions  of  those  organizations. 

Topics:  Cognitive  Domain  Issues,  Social  Domain  Issues,  C2  Experimentation 
Authors:  Dale  E.  Lichtblau,  Andrew  W.  Trice,  Steven  P.  Wartik 
Point  of  Contact:  Dale  E.  Lichtblau 
Institute  for  Defense  Analyses 
4850  Mark  Center  Dr. 

Alexandria,  VA  22311 
703-845-6683 
del@ida.org 


Abstract 

The  success  of  Net-Centric  Operations  and  Warfare  (NCOW)  depends  upon  the  ability 
of  net-centric  environment  (NCE)  users — both  human  and  automated — to  readily  dis¬ 
cover  useful  information  and  services.  Effective  discovery  requires,  in  turn,  good  seman¬ 
tic  metadata  “tagging”  (i.e.,  indexing  the  functions  of  the  services).  Good  tagging  reflects 
the  contextual  relationships  among  the  discoverable  artifacts.  It  derives  its  value  from  the 
soundness — and  intuitiveness — of  its  underlying  approach  to  infonnation  and  services 
classification.  Unfortunately,  classification  “soundness”  is  mostly  in  the  eye  of  the  be¬ 
holder,  particularly  for  services  that  can  be  deployed  for  many  different  purposes,  and  not 
all  necessarily  foreseen  by  their  initial  developers.  Ultimately,  therefore,  what  is  needed 
for  more  rapid  and  effective  tagging  and  discovery  is  a  services  classification  approach 
that  accommodates  multiple  perspectives  as  to  what  constitutes  a  natural  and  intuitive 
characterization,  plus  tools  that  enable  NCE  users  to  take  advantage  of  these  capabilities 
without  being  overwhelmed  by  the  sheer  multiplicity  of  different  classification  perspec¬ 
tives.  This  paper  presents  a  proposed  structure  for  the  semantic  metadata  that  we  believe 
will  facilitate  service  and  information  discovery  in  the  NCE,  and  will  easily  accommo¬ 
date  use  by  intelligent  agents. 
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1.  Introduction 

It  has  been  argued  before  in  [1]  “that  the  envisaged  net-centricity  in  future  warfare 
(command  and  control),  business  operations,  and  enterprise  management  is  dependent 
upon  a  robust  intelligent  assistance  capability  based  on  the  profuse  use  of  intelligent 
agents  throughout  the  [Global  Information  Grid’s  Net-Centric  Information  Environ¬ 
ment].”  This  paper  takes  that  argument  one  step  further,  arguing  that  the  success  of  Net- 
Centric  Operations  and  Warfare  (NCOW)  depends  upon  the  ability  of  net-centric  envi¬ 
ronment  (NCE)  users — both  human  and  automated — to  readily  discover  useful  informa¬ 
tion  and  services.  It  also  presents  a  proposed  structure  for  the  semantic  metadata  that  we 
believe  will  facilitate  service  and  information  discovery  in  the  NCE.  Effective  discovery 
requires,  in  turn,  effective  metadata  “semantic  tagging”  (i.e.,  tagging  that  describes  the 
function  and  meaning  of  services).1  Discovery  and  tagging  can  be  considered  opposite 
sides  of  the  same  coin:  successful  discovery  depends  on  good  tagging;  good  tagging  re¬ 
flects  the  contextual  relationships  among  the  discoverable  artifacts.  This  coin  thus  derives 
its  value  from  the  soundness — and  intuitiveness — of  its  underlying  approach  to  informa¬ 
tion  and  services  classification.  Unfortunately,  classification  “soundness”  is  often  in  the 
eye  of  the  beholder2,  particularly  for  services  that  can  be  deployed  for  many  different 


1  We  use  the  term  “semantic  tagging”  in  contrast  to  “syntactic  tagging”.  Syntactic  tagging,  as  implemented 
by  standards  such  as  WSDL  [2],  explains  how  to  invoke  the  service  or  the  service  point-of-contact,  and  so 
forth.  The  recent  draft  version  of  Department  of  Defense’s  Enterprise  Services  Strategy  (ESS),  [3],  under¬ 
scores  the  importance  of  meta-data  tagging  for  NCE  services,  but  it  does  not  offer  a  practical  approach  to 
effectively  “tagging”  what  a  tagged  service  is  suppose  to  do.  This  paper  attempts,  in  part,  to  address  that 
lack. 

2  A  simple  example  will  illustrate  the  essence  of  the  problem.  There  are  two  obvious  candidates  for  the  top¬ 
most  layer  of  a  services  taxonomy  for  DoD  services,  one  based  on  the  four  core  components  of  warfare 
(strike,  maneuver,  logistics,  and  force  protection)  and  one  based  on  the  high-level  functional  characteriza- 
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purposes,  and  not  all  necessarily  foreseen  by  their  initial  developers.  Ultimately,  there¬ 
fore,  what  is  needed  for  more  rapid  and  effective  tagging  and  discovery  is  a  services  clas¬ 
sification  approach  that  accommodates  multiple  perspectives  as  to  what  constitutes  a 
natural  and  intuitive  characterization,  plus  tools  that  enable  NCE  users  to  take  advantage 
of  these  capabilities  without  being  overwhelmed  by  the  sheer  multiplicity  of  different 
classification  perspectives. 

This  paper  describes  a  novel  approach  to  the  classification  of  net-centric  services  and  a 
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prototype  of  a  services  discovery  and  tagging  tool  that  implements  this  classification 
scheme.  The  prototype  demonstrates  the  feasibility  of  the  approach  and  illustrates  its 
value  in  delivering  the  promise  of  a  services-oriented  architecture  (SOA).  The  paper  is 
organized  as  follows.  Section  2  summarizes  the  basic  problem  this  paper  tackles.  Section 
3  describes,  in  general,  a  two-pronged  approach  to  the  classification  of  services.  Section  4 
describes  the  prototype  service  tagging  and  discovery  tool  we  developed  to  illustrate  the 
approach  and  to  demonstrate  its  feasibility.  Section  5  discusses  our  plans  for  extending 
the  prototype  in  two  basic  directions:  to  apply  it  to  the  semantic  classification  of  informa¬ 
tion,  and  to  enhance  its  intelligent  agent-based  features. 

2.  Problem  Statement 

Two  fundamental  questions  arise  in  connection  with  semantic  tagging  of  services.  One, 
why  go  to  the  trouble  of  using  a  structured  semantic  tagging  scheme,  rather  than  an  ad 
hoc  tagging  approach — or  no  tagging  at  all?  And  two,  if  we  accept  the  need  for  a  seman¬ 
tic  tagging  scheme,  what  kind  of — and  how  extensive — should  the  semantic  meta-data 
structure  be? 

Why  Use  a  Structured  Classification  Approach? 

Regarding  the  first  question,  some  writers  have  argued  that  structured  semantic  ap¬ 
proaches  to  tagging  content  that  utilize  taxonomies  and  their  (implicit  or  explicit  underly- 


tion  of  the  mission  of  the  department  (force  application,  force  support,  and  force  management).  The  imme¬ 
diate  question  becomes,  how  to  reconcile  these  separate  perspectives  to  each  other? 

3  Note  that  while  this  paper  is  focused  on  services ,  there  is  no  reason  why  the  approach  cannot  be  applied  to 
the  meta-data  tagging  and  subsequent  discovery  of  information  in  the  NCE  as  well. 
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ing)  ontologies4  are  greatly  overrated  [4,  5].  They  assert  that  Web  sites  or  systems  that 
use  ad  hoc  tagging  techniques  (e.g.,  the  Web  site  del.icio.us)  or  rely  on  full  text  indexing 
only  (e.g.,  Google  and  other  search  engines)  have  been  extraordinarily  successful,  to  the 
point  that  taxonomies  have  become  almost  irrelevant.  Specifically,  the  claim  is  that  the 
less  structured  approaches  provide  more  flexibility  and  power  for  content  discovery  with 
fewer  maintenance  issues  when  compared  to  a  structured  taxonomy  approach  to  navigat¬ 
ing  through  content  (similar  to  what  Yahoo  emphasized  in  earlier  years). 

We  acknowledge  that  such  less  structured  and  more  decentralized  approaches  to  discov¬ 
ering  content  (including  service  descriptions)  can  be  valuable,  but  also  argue  that  provid¬ 
ing  some  further  structure  in  the  tagging  approach  also  provides  key  advantages,  particu¬ 
larly  when  the  focus  of  discovery  is  narrowed  to  a  set  of  domain-specific  service  descrip¬ 
tions  and  metadata  rather  than  the  entire  Web.  Services  differ  from  information.  Services 
have  a  clearly  defined  function  and  operate  on  specific  kinds  of  content.  In  this  sense  they 
are  fundamentally  different  from  the  unstructured  web  pages  one  finds  using  Google  and 
similar  tools.  We  therefore  put  forth  three  arguments  for  using  structured  approaches  to 
classifying  services. 

First,  if  the  service  domain  is  at  least  somewhat  focused  and  domain  experts  can  usefully 
structure  it  through  the  careful  choice  of  terms  and  relationships,  this  structure  can  be 
browsed  and  enable  newcomers  to  the  domain  to  better  understand  it,  a  capability  not 
possible  through  ad  hoc  tagging  or  free  text  search.  Second,  less  formal  approaches  do 
not  capture  synonyms  or  concept  relationships,  while  taxonomies  can  easily  be  extended 
to  capture  this  important  information,  thus  potentially  improving  discovery  results 
through  “semantic  search”  tools  [6].  Third,  a  service  catalog  is  not  a  set  of  arbitrary  Web 
pages;  it  has  an  inherent  structure  in  terms  of  a  number  of  description  dimensions  (e.g., 
see  the  Service  Description  Framework  [SDF]  outlined  in  the  Enterprise  Service  Strategy 
[3]).  These  dimensions  of  structure,  including  any  semantic  dimensions  and  tag  values, 
can  be  used  collectively  to  attain  more  control  over  the  search  refinement  process  than 

4  We  define  an  ontology’  as  a  detailed  description  of  what  exists  in  a  domain  (of  interest),  including  the  rela¬ 
tionships  obtaining  between  those  entities.  We  view  a  taxonomy  as  a  “tree”  (i.e.,  hierarchy)  of  progressively 
more  specialized  concepts,  easily  obtained  by  selecting  all  of  the  “is  a”  relationships  from  a  given  ontology. 
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what  can  be  achieved  with  a  free  text  search.  Such  searches  place  a  higher  burden  on  the 
end  user  to  fonnulate  an  “intelligent  search”  through  the  manner  in  which  they  assemble 
the  search  terms,  as  compared  to  a  search  that  provides  a  variety  of  search  dimensions 
with  pre-selected  terms. 

What  structured  approach  for  Net-Centricity? 

So,  if  we  accept  that  a  structured  classification  approach  has  potential  advantages  such  as 
those  outlined  above,  the  challenge  becomes  one  of  creating  a  classification  strategy  that 
is  feasible  and  appropriate  for  the  NCOW/NCE  domain.  Among  the  key  characteristics 
(and  corresponding  requirements)  that  define  the  domain  are  the  following: 

•  Very  large.  Since  NCE  services  will  encompass  virtually  of  all  DoD’s  capabili¬ 
ties,  it  seems  clear  that  the  set  of  services  will  grow  very  large  over  time.  More¬ 
over,  individual  services  will  evolve  through  multiple  versions.  These  considera¬ 
tions  in  turn  imply  the  need  for  powerful  tools  that  enable  unsophisticated  users  to 
navigate  through  a  large  body  of  services  and  associated  classification  data  effi¬ 
ciently. 

•  Many  “federated”  communities  of  interest  (COIs).  Each  community  of  interest 
acts  as  its  own  interest  group  and  will  have  the  authority  to  semantically  “clas¬ 
sify”  services  (and  information)  in  any  way  that  is  most  appropriate  for  that  COI. 
In  such  an  environment,  a  classification  scheme  that  is  intuitive  to  one  user  may 
make  little  sense  to  another  who  does  not  belong  to  that  COI — and,  as  a  result,  be 
of  little  help  in  discovering  the  critical  information  or  net-centric  services  needed 
to  accomplish  a  mission.  This  implies  that  to  be  effective,  any  classification  ap¬ 
proach  must  accommodate  and  support  multiple  perspectives  on  the  classification 
services. 

•  Need  for  traceability  and  justification.  DoD  develops  and  maintains  a  large  body 
of  policies  and  procedures  that  service  developers  and  users  must  comply  with. 
Accordingly,  it  is  critical  that  the  service  classification  approach  aid  in  under- 


5 


standing  and  tracking  why  services  were  developed  and  what  specific  policies 
mandate  or  recommend  their  use. 

The  classification  approach  and  prototype  described  in  the  rest  of  this  paper  aim  to  begin 
addressing  these  concerns  and  requirements. 

3.  A  Two-Pronged  Approach  to  the  Classification  of  Services  in 
the  NCE 

We  propose  a  two-pronged  approach  to  the  classification  of  net-centric  services5  for  ser¬ 
vice  discovery  and  meta-data  tagging  in  the  NCE:  taxonomic  classification  and  faceted 
classification.  By  “taxonomic  classification”  we  mean  the  entry  of  the  name  of  a  service 
at  its  appropriate  place  in  a  traditional  hierarchical  taxonomy  that  systematizes  the  do¬ 
main  to  which  the  service  belongs.  A  text  messaging  service,  for  example,  could  be  clas¬ 
sified  as  a  messaging  service  within  a  larger  communications  service  sub-tree  of  a  more 
general  networking  tree  or  hierarchy  of  services.  See  Figure  1. 


Figure  1.  NCE  Service  Taxonomy  Fragment 

By  “faceted  classification”  we  mean  the  characterization  of  a  service  in  terms  of  a  set  of 
attributes  expressed  as  words  or  phrases.  We  call  the  faceted  classification  scheme  used 
in  the  prototype  (see  below)  the  “7  Ws”  scheme  because,  extending  the  familiar  who- 
what-when-where-why  paradigm,  there  are  seven  “W”  questions  used  to  classify  a  ser- 

5  Note  that  while  this  paper  is  focused  on  services ,  there  is  no  reason  why  the  approach  cannot  be  applied  to 
the  meta-data  tagging  and  subsequent  discovery  of  information  in  the  NCE  as  well. 
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vice.  We  chose  this  paradigm  as  a  very  pragmatic,  yet  powerful  approach  because  a  ser¬ 
vice  is  fundamentally  an  action,  and  in  natural  language  one  describes  an  action  using 
subject,  verb,  object,  and  adverbial  modifiers  reflected  in  the  facets  cited  above.  More¬ 
over,  our  approach  allows  for  other  reasonable  faceted  classification  schemes.  For  exam¬ 
ple,  one  might  consider  every  service  as  one  that  supports  a  phase  in  an  overall  manage¬ 
ment  process  (e.g.,  planning,  data  collection,  analysis,  decision-making,  execution,  moni¬ 
toring)  or  a  stage  of  Colonel  John  Boyd’s  OODA  (observation,  orientation,  decision,  ac¬ 
tion)  loop  [7], 

The  answer(s)  to  each  of  the  following  seven  questions — not  every  question  need  have  an 
answer — comprise  our  “faceted  classification”  of  an  NCE  service: 

1 .  Who  uses  the  service? 

2.  What  does  the  service  do? 

3.  On  what  does  the  service  act? 

4.  To  whom  is  the  service  generally  directed? 

5.  Where  is  the  service  used? 

6.  When  is  the  service  used? 

7.  Why  is  the  service  used? 

Note  that  the  second  question,  “ What  does  the  service  do?”  addresses  the  functionality  of 
the  service  per  se.  Accordingly,  it  should  mirror  a  service  taxonomy  that  appears  in  the 
strictly  taxonomic  part  of  our  classification  approach.  Note  also  that  the  third  and  fourth 
questions — the  third  and  fourth  facets — mirror  the  direct  and  indirect  objects  of  ordinary 
declarative  sentences  and,  depending  on  the  nature  of  the  verb  (or  action),  may  not  be 
applicable  to  every  service.  The  where  facet  is  intended  to  capture  the  typical  operational 
sphere  of  the  service,  while  the  when  facet  is  meant  to  highlight  the  usual  temporal  scope 
or  breadth  of  the  service,  for  instance,  near  real-time,  near-tenn,  long-tenn,  strategic,  etc. 
The  why  facet  would  be  particularly  useful  if  it  referenced  particular  DoD  directives,  in¬ 
structions,  guidance,  or  other  prescriptive  or  advisory  documents  that  could  put  the  ser¬ 
vice  into  the  larger  warfighting  context. 
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The  novelty  of  our  approach  is  two-fold.  First,  we  admit  the  appropriateness  of  having 
multiple  distinct  taxonomies  available  for  service  classification.  In  other  words  we  read¬ 
ily  admit  that  no  one  taxonomic  scheme  will  be  adequate  to  classify  the  large  variety  of 
services  likely  to  become  available  in  the  NCE.  Even  if  one  all-encompassing  taxonomy 
were  to  be  devised,  it  is  unlikely  that  everyone  within  the  Department  of  Defense  would 
accept  it.  Second,  the  possible  values  for  each  of  the  facets  used  in  the  faceted  classifica¬ 
tion  scheme  may  themselves  derive  from  a  taxonomy  appropriate  to  the  domain  of  that 
facet.  To  take  a  simple  example,  to  assign  an  appropriate  value  that  answers  the  question, 
“Who  is  the  (typical  or  most  likely)  user  of  the  service?”  we  offer  the  user  an  organiza¬ 
tional  taxonomy,  in  effect,  an  “organization  chart,”  from  which  the  most  likely  user  (or¬ 
ganization)  -  or  users  or  organizations  -  can  be  selected.  And,  as  in  the  case  of  the  purely 
taxonomic  dimension  of  our  proposed  approach,  multiple  taxonomies  will  most  likely  be 
available  from  which  facet  values  may  be  selected6.  A  variety  of  organizational  taxono¬ 
mies  should  be  available,  reflecting  different  organizational  levels  and  types  (e.g.,  bu¬ 
reaucratic,  functional,  etc.). 

Overall,  our  approach  is  to  take  advantage  of  as  much  structure  as  possible  that  is  already 
available  from  existing  (and  future)  taxonomies,  rather  than  attempting  to  invent  a  new 
language  of  our  own.  We  argue  that  this  approach  will  both  increase  the  chances  of  its 
adoption  in  the  community  and  offer  benefits  to  users  beyond  those  provided  by  a  free 
text  search  through  a  less  structured  service  catalog.  If  service  developers  and  users  have 
tools  of  sufficient  sophistication  to  navigate  and  apply  both  types  of  classification  struc¬ 
tures,  it  will  be  easier  to  understand  the  mission  domains  to  which  the  services  apply,  as 
well  as  to  browse  and  search  them. 

More  formally,  we  define  a  taxonomic  classification  of  a  service  s  with  respect  to  a  tax¬ 
onomy  T  as  follows.  Let  SC  be  the  set  of  all  service  classes  (i.e.,  nodes  of  a  service  tax¬ 
onomy  tree).  Let  S  be  the  set  of  all  services.  An  element  sc  of  SC  is  a  2-tuple 
(, SC '  c=  SC,  S'  c=  S )  such  that  sc  g  pow(5'C') ,  where  pow(7)  denotes  the  power  set  of  / 


6  A  service  can  also  be  classified  in  terms  of  facets  that  are  not  hierarchically  structured  like  a  taxonomy 
(e.g.  a  scalar  number,  a  date,  a  developer  name),  but  the  implementation  of  this  is  straightforward  and  not 
considered  here. 


(i.e.,  sc  is  the  root  of  a  tree)  and  (Vs  e  S'^—3(SC,  S')  e  pow(SC')  |  s  e  s)  (i  .e.,  a  service  is 

classified  at  most  once  in  a  taxonomy).  We  may  now  define  a  taxonomy  as  a  set  of  ser¬ 
vice  classes  SC  c:  SC ,  and  a  classification  of  a  service  as  a  three-tuple 
(T,sc  <=  SC,s  e  S) ,  stating  that  service  5  is  classified  as  a  member  of  service  class  sc  in 
taxonomy  T.  If  service  s  were — as  is  quite  likely — also  classifiable  within  another  service 
taxonomy  T'  =  SC"c;SC,  then  (T',sc  e  pow(SC"),s  e  S)  characterizes  s  in  tenns  of 

this  second  taxonomy:  Service  s  is  classified  as  belonging  to  some  service  class  sc  of  T' . 
We  envisage  an  indefinite  number  of  service  taxonomies  available  for  the  appropriate 
taxonomic  classification  of  services.  Most  of  these  service  taxonomies  will  be  devised  by 
COIs. 

In  the  prototype  we  use  (parts  of)  several  currently  available  service  taxonomies.  The 
Navy’s  Common  Services  Function  List  (CSFL)  provides  a  reasonably  well-organized 
and  documented  set  of  1,025  functions,  taxonomized  under  three  major  headings  (com¬ 
bat,  infrastructure,  and  business  support).7  The  actions  and  capabilities  taxonomies  of 
DoD’s  core  taxonomy  [8]  were  also  selected  to  illustrate  our  proposed  approach  in  the 
prototype. 

With  respect  to  faceted  classification,  the  formal  representation  of  the  classification  of  a 
service  s  in  tenns  of  a  faceted  classification  FC  can  be  defined  as  follows.  Suppose  FC 
has  a  set  of  n  facets  F  =  {FvF2,...,Fn} .  Remember  that  we  can  regard  each  facet  as  a 
taxonomy,  and  that  we  classify  a  service  using  zero  or  more  terms  from  each  facet.  We 
can  therefore  represent  the  classification  as  a  set  ,.s’)j  where 

fi&F  and  nf  e  pow(/j) .  For  instance,  within  the  “7  Ws”  scheme,  a  service  5  might  be 
classified  using  the  set: 

{(FWh0 ,  Dcfense_Otg,.s'),  (Fwho ,  Govcrnment_Org,.s’),  (FWherc ,  CONUS,  j)} 


7  The  CSFL  was  adopted  to  represent  the  services  architecture  within  DoD’s  Federal  Enterprise  Architec¬ 
ture  (www.feams.gov). 
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which  means  that  service  s  has  two  likely  categories  of  users,  and  is  used  in  or  has  a  geo¬ 
graphical  scope  of  the  continental  United  States. 

4.  The  Prototype 

Our  prototype  contains  functions  for  both  tagging  and  discovery  of  NCE  services.  Ser¬ 
vices  can  be  discovered  interactively  by  a  human  user  or  by  a  software  agent  that  invokes 
the  service  discovery  tool  as  a  service  in  its  own  right.  Services  can  be  discovered  by  ju¬ 
dicious  navigation  through  the  hierarchy  of  a  taxonomy,  by  facet-based  searches,  or  both. 
The  tool  is  also  designed  to  enable  a  service  producer  to  provide  the  other  meta-data  pre¬ 
scribed  by  the  DoD  ESS  (name,  textual  description,  developer,  etc.).  And,  of  course,  this 
additional  information  is  also  provided  to  the  user  who  is  using  the  tool  for  service  dis¬ 
covery. 

The  user  initiates  the  prototype  application  and  creates  or  opens  a  project.  Each  project 
consists  of: 

•  A  set  of  services  the  user  wants  to  classify.  This  set  may  be  empty  if  the  user  is 
only  interested  in  service  discovery. 

•  A  set  of  taxonomic  classifications  the  user  deems  relevant  for  classification  or  dis¬ 
covery. 

•  A  set  of  faceted  classifications  the  user  deems  relevant  for  classification  or  dis¬ 
covery. 

•  Classification  information  the  user  creates  that  relates  services  to  the  taxonomic 
and  faceted  classifications. 

Each  service,  taxonomic  classification,  and  faceted  classification  is  uniquely  identified  by 
a  Uniform  Resource  Identifier  (URI)  [9].  Currently  the  prototype  simply  asks  the  user  to 
enter  a  textual  URI.  Eventually,  we  envisage  a  single  and  logically  centralized  (albeit  rep¬ 
licated)  directory  of  available  services  that  the  tool  will  link  to  automatically. 

After  the  user  (or  agent)  opens  a  project,  the  prototype  displays  the  window  similar  to 
that  shown  in  Figure  2.  The  pane  on  the  left  contains  the  set  of  services  in  the  project;  the 
user  selects  a  particular  service  using  the  drop-down  list  in  the  upper  left.  The  rest  of  the 
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left  pane  shows  information  about  the  service.  This  information  includes  its  name,  a  tex¬ 
tual  description,  and  its  URL  We  also  foresee  the  need  for  information  on  how  to  invoke 
and  use  the  service,  plus  additional  information  DoD  might  mandate.  The  prototype 
should  incorporate  tools  to  help  users  create  this  information,  but  currently  it  simply  asks 
for  two  URIs:  first,  of  a  document  containing  a  Web  Service  Description  Language 
(WSDL)  specification  of  a  service’s  syntax  [2],  and  second,  of  a  document  containing  a 
DoD  Discovery  Metadata  Specification  (DDMS)  [10].  These  URIs  are  placeholders  to 
suggest  future  functionality. 

The  right  pane  is  split  into  two  sub-panes.  The  top  pane  contains  the  taxonomic  classifi¬ 
cations  in  the  project;  the  bottom  pane  contains  the  faceted  classifications.  The  user  can 
add  classification  schemes  and  services  to,  or  remove  them  from,  the  project  at  any  time. 

Figure  2  shows  how  the  user  can  classify  a  service  with  respect  to  a  particular  taxonomy. 


Figure  2.  Taxonomic  Classification  of  a  Service 
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Here,  the  user  has  indicated  that  the  Request  Medivac  Services  is  a  Logistical  Action  in 
the  DoD  Core  Actions  and  Functions  Taxonomies.  The  prototype  displays  this  fact  both 
graphically  (in  the  taxonomy  tree  in  the  upper  right  pane)  and  textually  (in  the  Service 
Taxonomy  Classifications  table  in  the  left  pane). 

Figure  3  shows  how  the  user  can  tag  a  service  using  terms  from  the  faceted  classification. 
The  user  has  chosen  the  “Military  forces  organization”  term  from  the  “Who”  facet  and 
the  “Protection”  tenn  from  the  “Why”  facet.  As  we  discussed  in  Section  3,  it  is  not  neces¬ 
sary  to  assign  terms  from  all  facets,  so  the  prototype  is  showing  a  valid  faceted  classifica¬ 
tion  of  the  Request  Medivac  Services. 

These  two  screenshots  have  illustrated  some  of  the  actions  a  user  may  take  to  classify  a 


Figure  3.  Faceted  Classification  of  a  Service 
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service.  The  user  may  also  extend  the  classification  hierarchies,  adding  new  service 
classes  to  a  taxonomy  and  new  subfacets  to  a  faceted  classification.  The  prototype  also 
allows  the  user  to  define  synonyms  in  the  faceted  classification  so  as  to  increase  the 
probability  that  a  search  by  facets  will  yield  a  result. 

The  prototype  currently  saves  classifications,  extensions,  and  synonyms  that  a  user  de¬ 
fines  in  a  local  file.  Ultimately,  the  tool  would  help  the  user  publish  the  information  in 
some  globally  accessible  location.  In  this  way,  the  user  would  propagate  semantic  infor¬ 
mation  about  a  service. 

The  prototype  supports  discovery  as  well  as  tagging.  Figure  4  provides  an  example.  The 
user  has  searched  known  taxonomies  (those  displayed  in  the  upper  right  pane)  for  ser- 


Figure  4.  Service  Discovery  through  Taxonomies 
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vices  whose  names  contain  the  word  “logistics,”  and  the  application  has  found  and  dis¬ 
played  ten  such  services.  The  prototype  permits  the  user  to  add  these  services  to  the  left 
pane  for  further  examination  and  perhaps  additional  classification.  In  this  way  the  user 
can  continue  to  add  semantic  metadata  to  a  service. 

The  prototype  also  supports  searches  using  faceted  classification  terms.  This  is  a  con¬ 
trolled  vocabulary  search  in  which  the  user,  presented  with  a  fixed  set  of  terms,  chooses 
the  subset  of  them  that  seem  most  relevant  to  his  needs.  Figure  5  shows  an  example.  The 
user  has  performed  a  search  using  two  terms:  “Intelligence  organization,”  from  the 
“Who”  facet,  and  “Theater,”  from  the  “Where”  facet.  The  prototype  has  listed  all  services 
that  are  used  by  intelligence  organizations  and/or  operate  in  theater.  Once  again  the  user 
can  now  examine  each  discovered  service  more  closely  and  perhaps  further  classify  it. 


Figure  5.  Service  Discovery  by  Faceted  Classification 
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The  currently  implemented  searching  capabilities  are  basic.  A  real-world  application 
would  let  the  user  choose  between  conjunction  and  disjunction  of  terms,  for  example. 
However,  the  prototype’s  capabilities  serve  to  demonstrate  our  vision  of  classification, 
discovery,  and  their  interconnectedness. 

5.  Summary  and  Plans  for  Future  Work 

In  summary,  we  presented  an  NCE  service  classification  approach  and  described  a  proto¬ 
type  tool  that  promises  to  make  more  effective  use  of  NCE  services.  The  approach  is 
powerful,  yet  flexible,  and  is  consistent  with  the  federated  COI  perspective. 

In  tenns  of  future  work,  we  hope  to  demonstrate  how  the  prototype  can  be  invoked  by  an 
intelligent  software  agent  to  automatically  return  the  name  of  (and  eventually  invoke)  a 
service  that  has  been  “discovered”  as  sharable  within  the  NCE.  In  addition,  we  will  ex¬ 
plore  implementing  an  unsupervised  learning  algorithm  that  enables  the  prototype  to  of¬ 
fer  users  candidate  selections  based  on  its  tracking  of  the  general  user  community’s  most 
successful  historical  search  (navigation)  experience.  Not  only  will  this  approach  facilitate 
efficient  discovery  and  good  service  tagging,  it  will  also  allow  NCE  managers  to  easily 
determine  which  taxonomies  and/or  faceted  classification  schemes  have  little  if  any  value 
to  the  NCE  community  due  to  minimal  use  by  the  larger  community  of  users. 
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Next  steps 


Motivation 


I  mportance  of  discovery  and  tagging  in  the  NCE I 

INFORMATION  TECHNOLOGY  &  SYSTEMS  DIVISION 


Assertions: 

1)  Success  of  NCOW  depends  upon  ability  to  readily  discover 
useful  information  and  services  in  the  NCE 

2)  Effective  discovery  depends  on  good  semantic  tagging 
S)  Good  semantic  tagging  must  be  sound  and  intuitive 

4)  But,  “sound”  and  “intuitive”  are  in  the  eye  of  the  beholder 

Problems: 

1)  Howto  resolve  or  finesse  the  conflicting  perspectives? 

2)  How  to  provide  tools  that  support  NCE  user  without 
overwhelming? 
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Problem  Statement 

NCE  services  classification 
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Why  use  a  structured  classification  approach? 

□  Some  claim  taxonomies  and  ontologies  are  (almost) 
irrelevant 

•  Why  not  just  Google  everything? 

□  S  responses 

•  Structured  knowledge  facilitates  domain  understanding 

•  Structure  facilitates  automated  search  and  associated  tools 

•  Service  catalog  has  inherent  structure  that  can  be  leveraged 

What  structured  classification  approach  for  net- 
centricity? 

□  Requirements 

•  Accommodate  very  large  collection  of  services 

•  Encompass  many  federated  COIs 

•  Provide  traceability  and  justification  for  services 

□  Conclusion:  one  monolithic  approach  will  not  work 

•  Lack  of  agreement 

•  Diversity  of  communities  and  requirements 
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i  wo- pronged  uassmcation 

Approach  j  f)/j 

(Taxonomic  and  Faceted) 


Taxonomic  classification  Faceted  classification 


“Traditional”  approach 

Based  on  concept  hierarchy  with 
rules  aggregating  unique 
distinguishing  features  of  item 

Applies  more  to  the  “essence”  of  the 
item 

Can  place  item  at  0-1  nodes  in  a 
particular  taxonomy 

There  can  be  multiple  taxonomies 


More  recent  approach 

Based  on  “facets”:  categories  isolating 
useful  perspectives  on  an  item 

Applies  more  to  specific  properties  of 
the  item 

Assigns  0-n  defined  values  per  facet 
to  an  item;  can  also  add  synonyms 

There  can  be  multiple  facet  schemes 


Key  Concepts, 

Classification 

Rules 


Top 

0 

< 

0 

“Leaf  Node” 


Classified  Item 


Facet  1,  Facet  2,  ...  Facet  n, 

Term/Keyword  X  Term/Keyword  Y  Term/Keyword  Z 


Collection  of  Properties 
Classified  Item 


5 


Proposed  Strategy  for  NCE  Services  Classificationj*^^^ 
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Apply  both  approaches,  as  follows: 

Taxonomic  classification 

□  Use  to  position  services  within  communities  of  interest 

□  Use  as  basis  for  browsing  and  comparing  services 

□  Incorporate  “standard”  DoD  taxonomies  based  on  user  and  COI 
demand 

□  Can  be  used  to  enable  automated  reasoning  through  hierarchical 
structure 

Faceted  classification 

□  Use  facets  to  support  a  structured  tagging  approach 

□  Use  to  improve  searches  and  refinements  to  searches 

□  Synthesize  a  proposed  high-level  faceted  classification  system  for 
NCE  Services 

Based  on  fundamental  categories 

Incorporate  other  DoD  knowledge  and  terminology 
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Taxonomic  Classification 

"Standard"  DoD  Taxonomy  List 
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DoD  Core  Taxonomy* 

CSFL  (Common  System  Function  List)* 

Army  Battlespace  Command  Knowledge  System 
(BCKS)  Reference  Taxonomy 

Geospatial  Services  Taxonomy 

USAF  Core  I  nformation  Taxonomy 

NCTC  Architecture  Reference  Model  Services 
Taxonomy 


*Used  in  the  prototype 
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Taxonomic  Classification 

DoD  Core  Taxonomy 
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Background  and  status 

□  “Baseline  taxonomy  for  NCES  discovery  capability  [services  or 
information]” 

□  Developed  by  MITRE  et  al. 

□  Submitted  to  DoD  Metadata  Registry  1/11  /2005 


Description  and  structure 

□  Class/subclass  hierarchy  (214  classes) 

□  Each  class  is  a  concept  of  interest  to  DoD 

□  Each  class  has  a  (textual)  description 

□  Each  description  has  a  source  (e.q.,  GAO/AFMD2.1 .1 , 
lnvestorWords.com,  Merriam-Webster) 
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DoD  Core  Taxonomy:  Structure  ID A 
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DoD  Core  Taxonomy 

- Account - 

- Action - 

- Agreement 

- Asset - 

- Capability 

- Environment 

- Event 

- Function  — 

- Guidance 

- Interval 

- Location 

- Organization 

- Person 

- Role 


Accounting 

Federal-Fund 

Nonfederal 

Settlement 


Support 


- Military 

—  Acquisition 

—  Budgetary 

—  Environmental 


Battlespace  Awareness 
C2 

Force  Application 


Intangible-Asset 

Tangible-Asset 

Defense  and  National  Security 


Environmental  Management 
Health 


Resource  Management 


Services  Support - ...  _  , 

Branches  best  aligning 

with  services  in  bold 
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Taxonomic  Classification 

CSFL  (Common  System  Function  List) 
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•  Background  and  Status 

□  “System  functions  and  associated  definitions”  supporting 
all  aspects  of  Combat,  Infrastructure,  and  Business 
activities 

□  Developed  by  the  Department  of  the  Navy 

□  Can  be  basis  for  service  components  in  DoD  EA  SRM 

□  A  well-written,  large  collection  of  system  functions 

•  Description  and  structure 

□  Each  function  has  a  name,  a  description,  and  a  domain 
(and  sub-domains) 

□  Leaf  nodes  can  be  cast  as  potential  invokable  services 
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CSFL Taxonomy:  Structure  ID A 
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CSFL 


Combat 


—  Sense - 

—  Command 

—  Act 

—  Interoperate 


Sense  Sub-Domain 


Active  Search 
Air  Search 


Data  Fusion 


Analyze  Node  Assignments 
Characterize  Sensors 


Track 


Assign  Track  Category 
Identify  Civilian  Air  Tracks 


Infrastructure 


Sustainment 
IT  Infrastructure 


Business 


Logistics 

Human  Capital  or  WorkForce  Management 
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Faceted  Classification:  "7  W's"  Framework 
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Who  uses  the  service 

□  Notion  of  the  service’s  client  or  invoker 

□  Could  also  include  service  developer  or  “distributor” 

MTiafthe  service  activity  is 

□  Verb  denoting  the  activity 

On  What 

□  Notion  that  the  service  must  act  on  an  input  or  object;  tied  to 
the  service  domain 

To  Whom 

□  Covers  case  where  object  is  a  person 

When  the  service  occurs  or  has  an  effect 

□  Typically  a  temporal  performance  measure 

Where  the  service  applies  or  has  an  effect 

□  Could  be  geographic  (e.g.,  CONUS)  or  conceptual 
(e.g.,  relative  to  a  battlespace) 

Whyxhz  service  is  used 

□  Reference  to  authorities,  responsibilities,  regulations,  gui^nce 


Taxonomic  Classification  of  a  ServiceD/I 
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Faceted  Classification  of  a 
Service 
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*  Service  Classifier:  NCOW  Services  Classification 

1  File  Project 

Services 


Request  Medrvac  Service 


Service 

Name: 


Request  Medrvac  Service 


Description:  This  service  can  provide  medical  evacuation  of 
wounded  personnel. 


URI:  http:OTricka.itsd. ida.org:8080/sc/medivac-service.xml 

WSDL  URJ:  http:OTricka.itsd. ida.orq:8080/sc/my-wsdl.xml 
DDMS  URI:  http:OTricka.itsd.ida. org:8080/sc/my-ddms.xml 

Classifications 

Service  Taxonomy  Classifications 


Taxonomy 


DoD  Core  Taxonomy  Action.. 


Path 


Action->SupportAction->Lo.. 


Faceted  Classifications 


Focus 

From  Faceted  Classification 

Military  forces  organization 

7  W's  Classification 

Protection 

7  Ws  Classification 

View 


Delete 


Taxonomic  Classifications 


0  Common  System  Functions  List  (CSFL)  Taxonomy  r 


13  Common  System  Functions  List  (CSFL)  Taxonomy 

I 


Search  Taxonomies... 


Faceted  Classifications 


FI  7  Ws  Classification 


- 


□  D 


13  7  W's  Classification 
?  Who 

?  t  DoD  Core  Taxonomy  for  Organization 
o-  *  Governmental  organization 
f  *  Defense  organization 

— O  Military  forces  organization  (T) 

— O  Civilian  forces  organization 
®-  Nongovernmental  organization 
°-  What 
o-  On  What 
o-  To  Whom 
o-  When 
o-  *  Where 
-  Why 

o-  *  Requirement 
°-  Objective 
t  *  Capability 

o-  *  Business  Capability 
?  Military  Capability 

— O  Battlespace  awareness 
— O  Battlespace  communications 
— O  Focused  logistics 
— O  Force  application 
—O  protection  (T) 
o-  Intelligence  Capability 

o-  Enterprise  Information  Environment  Capability 


Search  for  Services... 
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Service  Discovery  through 


Taxo  nomies 


YSTEMS  DIVISION 


ID A 
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Service  Discovery  by  Faceted 
Classification 
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=  Service  Classifier:  NCOW  Services  Classification 

I  File  Project 

1  Services 

Taxonomic  Classifications 

Requ 


Sen 

Nan 

Des 


UR): 

WS 

DDfJ 

Cl; 

S 


Available  Terms 


@  Faceted  Classification  Search 


Search  Terms 
Facets 
O  Who 
O  What 
O  On  What 
O  To  Whom 
O  When 
<§>  Where 
O  Wily 


Selected  Terms 


a* 

tone 


Global 

In-transit 

OCONUS 

Operational 

POD 

POE 

Regional 

Strategic 

Tactical 

Theater 


Intelligence  organization 
Theater 


□  Ignore  Case 


Start  Search 


Discovered  Services 


Analyze  CONUS  and  Theater  Material  Handling  Equipment  Availability 


Analyze  Theater  Assets  Entry/Exit  Visibility 


Analyze  Theater  Commercial  Supply  Availability 


Analyze  Theater  Engineering  Infrastructure  Capability  and  Limitation 


Name 


Analyze  Theater  Engineering  Supportability 


Analyze  Theater  Engineering  Units  Visibility 


Analyze  Theater  Facility  Requirements 


Analyze  Theater  Host  Nation/Commercial  Engineering  Asset  Availability 


Analyze  Theater  Host  Nation/Commercial  Medical  Capabilities 
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Summary  ID A 

INFORMATION  TECHNOLOGY  &  SYSTEMS  DIVISION 


Discovery  and  tagging  in  the  NCE  are  critical 

Combining  taxonomic  and  faceted  classification  is  a 
promising  approach  for  improving  discovery  and 
tagging 

□  Supports  multiple  constituencies 

□  Is  intuitive 

□  Is  standards-based 

□  Improves  search 

Prototype  functionality  demonstrated 
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Next  Steps 

INFORMATION  TECHNOLOGY  &  SYSTEMS  DIVISION 

I  ntegrate  with  intelligent  software  agent 

Learning  algorithm  for  suggesting  appropriate  tags 

Facilitate  evolution  of  taxonomies  and  tags  for  NCE 
Services 
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